Explicit Baseball Event Detection by Combining Visual and Speech Information
نویسنده
چکیده
To explicitly detect baseball events, a multimodal approach that combines the decisions of visual and speech event detection is proposed. We respectively perform event detection from visual and speech perspectives and estimate the corresponding confidences. By combining the decisions from different modalities, the detection performance increases by 8% ~ 20%, in terms of F1 metric. This work achieves explicit event detection in baseball videos and facilitates the development of realistic applications for management or entertainment.
منابع مشابه
Joint processing of audio and visual information for multimedia indexing and human-computer interaction
Information fusion in the context of combining multiple streams of data e.g., audio streams and video streams corresponding to the same perceptual process is considered in a somewhat generalized setting. Speci cally, we consider the problem of combining visual cues with audio signals for the purpose of improved automatic machine recognition of descriptors e.g., speech recognition/transcription,...
متن کاملSpeech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کاملOnline multiple people tracking-by-detection in crowded scenes
Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...
متن کامل(1) Explicit Baseball Event Detectionintegration of Rule-based and Model-based Methods for Baseball Event Detection,"
In this paper, we use an associative memory based on SOM to memorize music data. The goal is to structure a learning system which can deal with complex music. However now, we build the memory system, and input some music data. The system will be self-connected. Then, we observe the connection of memory nodes, and we will fix the memory system based on those observations. Then we describe my mem...
متن کاملNon - Speech Acoustic Event Detection Using
Non-speech acoustic event detection (AED) aims to recognize events that are relevant to human activities associated with audio information. Much previous research has been focused on restricted highlight events, and highly relied on ad-hoc detectors for these events. This thesis focuses on using multimodal data in order to make non-speech acoustic event detection and classification tasks more r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006